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Abstract 

Background: Numerous clinical tests are used in the diagnosis of anterior cruciate ligament (ACL) injury but their 
accuracy is unclear. The purpose of this study is to evaluate the diagnostic accuracy of clinical tests for the 
diagnosis of ACL injury. 

Methods: Study Design: Systematic review. The review protocol was registered through PROSPERO 
(CRD4201 2002069). 

Electronic databases (PubMed, MEDLINE, EMBASE, CINAHL) were searched up to 19th of June 2013 to identify 
diagnostic studies comparing the accuracy of clinical tests for ACL injury to an acceptable reference standard 
(arthroscopy, arthrotomy, or MRI). Risk of bias was appraised using the QUADAS-2 checklist. Index test accuracy 
was evaluated using a descriptive analysis of paired likelihood ratios and displayed as forest plots. 

Results: A total of 285 full-text articles were assessed for eligibility, from which 14 studies were included in this 
review. Included studies were deemed to be clinically and statistically heterogeneous, so a meta-analysis was not 
performed. Nine clinical tests from the history (popping sound at time of injury, giving way, effusion, pain, ability to 
continue activity) and four from physical examination (anterior draw test, Lachman's test, prone Lachman's test 
and pivot shift test) were investigated for diagnostic accuracy. Inspection of positive and negative likelihood ratios 
indicated that none of the individual tests provide useful diagnostic information in a clinical setting. Most studies 
were at risk of bias and reported imprecise estimates of diagnostic accuracy. 

Conclusion: Despite being widely used and accepted in clinical practice, the results of individual history items or 
physical tests do not meaningfully change the probability of ACL injury. In contrast combinations of tests have 
higher diagnostic accuracy; however the most accurate combination of clinical tests remains an area for future 
research. 

Clinical relevance: Clinicians should be aware of the limitations associated with the use of clinical tests for 
diagnosis of ACL injury. 

Keywords: Anterior cruciate ligament. Diagnosis, Medical history taking. Physical examination. Diagnostic test 
accuracy 
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Background 

The anterior cruciate ligament (ACL) is an important 
stabilising structure of the knee and its disruption is as- 
sociated with pain and activity limitation [1]. The annual 
incidence of ACL injury ranges from 0.01% to 0.05% [2], 
however it is higher in sporting groups and most fre- 
quently affects individuals during late adolescence and 
early adulthood [3-5]. The prevalence of ACL injury in 
adults presenting to primary care with acute knee pain is 
estimated to be 4% [6]. Many cases are initially missed 
[7] in primary care and these undiagnosed ACL injuries 
are of concern because of the risk of cartilage tear and 
premature knee osteoarthritis [8]. 

Clinical diagnosis of ACL injury is based upon history 
and physical examination findings with suspected cases 
confirmed by MRI or arthroscopy [9]. Numerous clinical 
tests and findings have been proposed to aid the diagno- 
sis of ACL injury. A popping sound, swelling and in- 
stability following high impact sport trauma along with a 
positive Lachmans, anterior draw or pivot shift test is 
the most common method of clinical diagnosis [9]. How- 
ever, there are over 25 specific physical tests and numerous 
features from the clinical history that have been proposed 
for detection of ACL injury [10]. At present the diagnostic 
accuracy of these tests is unclear. 

Most existing reviews evaluating the accuracy of tests 
to diagnose ACL injury [6,11-14] are now over a decade 
old and contain methodological limitations such as in- 
clusion of inappropriate studies and pooling of estimates 
from heterogeneous studies. Since these reviews were 
published there has been much progress in the diagnos- 
tic field with regard to study appraisal and synthesis 
[15]. There is now a greater appreciation of how design 
features may lead to biased estimates of diagnostic test 
accuracy and when meta-analysis is justified. In addition 
it is likely that more recent primary research studies 
have been conducted in the area of ACL diagnosis. 

The objective of this systematic review was to report 
the diagnostic accuracy of clinical tests for the diagnosis 
of ACL injury and describe the quality of research evalu- 
ating these tests. 

Methods 

A systematic review protocol [16] was registered at the 
International Prospective Register of Systematic Reviews - 
PROSPERO 2012:CRD42012002069. 

Identification of selected studies 

Electronic databases (PubMed, MEDLINE, EMBASE, and 
CINAHL) were searched for eligible diagnostic studies from 
the earliest year possible up to 19th of June 2013. The 
search strategy was developed for PubMed and modified 
for use in other databases (Additional file 1: Table SI). The 
reference lists of all included publications and relevant 



systematic reviews were checked and forward citation 
searches performed. 

Eligibility criteria 

Diagnostic studies were eligible if they compared the ac- 
curacy of history taking or physical examination to an 
acceptable reference standard (arthroscopy, arthrotomy, 
or MRI) in the identification of ACL injury. Both pro- 
spective and retrospective studies were eligible for inclu- 
sion. We did not include case control studies as they 
substantially overestimate diagnostic accuracy compared 
with studies that use a clinical population [17]. 

The focus of this review was on studies that evaluated 
patients presenting to a care provider for diagnosis of 
knee pain or dysfunction, where the diagnostic accuracy 
of individual, or combinations of, history features or 
physical assessment procedures was evaluated. Studies in 
which a substantial proportion of recruited patients had 
already been diagnosed with ACL injury were excluded 
to minimise verification bias [17]. 

Included studies had to report sufficient data on diagnos- 
tic tests to enable construction of a 2 x 2 table so estimates 
of diagnostic accuracy (such as sensitivity and specificity) 
could be calculated. Studies that evaluated the accuracy of 
an unspecified combination of history and physical examin- 
ation, such as clinical diagnosis or global clinician judgment 
were excluded as they do not allow for replication, valid- 
ation and generalization of the study results [18]. 

If studies had been reported in abstracts or conference 
proceedings, the related full publications were retrieved 
if available, but only full articles published in peer- 
reviewed journals were included. Studies published in all 
languages were considered eligible and translations were 
sought where necessary. 

Study selection 

Two authors (MS and NH) independently screened all 
titles and abstracts identified in the searches with respect 
to the inclusion and exclusion criteria. Full text copies of 
potentially relevant articles were retrieved and final in- 
clusion or exclusion was determined. Disagreements re- 
garding inclusion were resolved by consensus, including 
a third review author (SK) where necessary. 

Data extraction 

Three review authors (MS, NH, SK) independently ex- 
tracted information from the included studies. Data were 
extracted into a specifically designed spreadsheet and in- 
cluded details on the study design, setting, enrolment 
procedures, number of participants, patient demograph- 
ics, and time since initial ACL injury. Details of the type 
of index test and the type of reference standard were 
also extracted and the proportion of participants with 
ACL injuries was calculated for each included study. 
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Diagnostic two-by-two tables (true positive, false positive, 
true negative and false negative) were either extracted from 
the publications or reconstructed using information from 
other reported parameters (sensitivity, specificity, or pre- 
dictive values). Uninterpretable index test outcomes, such 
as an equivocal finding were dealt with as a negative index 
test finding. The authors of one study [19] were contacted 
and provided additional information. 

Quality assessment 

The quality of each included study was assessed by two 
review authors (MS, NH) using the QUality Assessment 
of Diagnostic Accuracy Studies (QUAD AS -2) checklist 
[20]. The QUADAS-2 checklist consists of four domains 
relating to patient selection, index test, reference stand- 
ard, and flow and timing. Each domain is assessed in 
terms of risk of bias, and the first 3 domains are also 
assessed in terms of applicability. The review authors 
classified each item as "yes" (adequately addressed), "no" 
(inadequately addressed), or "unclear" (inadequate detail 
presented to allow a judgment to be made). Disagree- 
ments were resolved by consensus and consulting with a 
third (SK) review author where necessary. 

Synthesis of results 

The two-by-two tables were used to calculate index test 
summary statistics: sensitivity, specificity, likelihood ra- 
tios along with their 95% confidence intervals using 
MetaDiSc 1.4. Index test accuracy was presented as for- 
est plots of likelihood ratios, as likelihood ratios provide 
the best way for clinicians to use diagnostic data to es- 
tabUsh clinical diagnoses in patient care [21]. Categorisa- 
tion of likelihood ratios was adopted from Jaeschke et al. 
[18] where positive likelihood ratios (+LR) <5 and a 
negative likelihood ratios (-LR) >0.2 were considered small, 
+LR 5-10 and -LR 0.1-0.2 were moderate, and + LR>10 
and -LR <0.1 were considered large, with respect to chan- 
ging the pre to post-test probability. 

Both clinical and statistical heterogeneity as well as 
methodological quality were evaluated to determine the 
appropriateness of meta-analysis. Assessment of clinical 
heterogeneity involved comparison of the study popula- 
tions, settings, performance of index tests and reference 
standards among included studies. Assessment of statis- 
tical heterogeneity involved visual inspection of forest 
plots and performance of the chi-square (x^) test and 
calculation of the inconsistency index (I^) which quantifies 
the proportion of variation across the included studies that 
is due to heterogeneity rather than chance [22]. 

Results 

Study selection 

The initial database searches retrieved 21,691 citations 
of which 10,796 citations remained after duplicates were 



removed (Figure 1). Screening of the titles and abstracts 
identified 285 potentially relevant articles that were re- 
trieved in full text format. Forwards and backwards cit- 
ation tracking identified 12 potentially relevant articles 
which were also retrieved. Fourteen articles were finally 
included, of which 11 were published in English [19,23-32] 
and three in German [33-35]. Additional file 2: Table S2 
lists the reasons for excluding 28 articles that were included 
in one or more of the previous five systematic reviews. 

Description of included studies 

Of the 14 included articles, 10 had a prospective study 
design [19,23,24,28-30,32-35], two used a retrospective 
design [26,27] and for two studies [25,31] the design was 
unclear (Table 1). 

Only one study [19] evaluated the diagnostic accuracy 
of clinical tests in primary care. The other 13 studies 
evaluated the accuracy of clinical tests in secondary con- 
tact settings, defined here as either a referral to an 
orthopaedic department or presentation to an accident 
and emergency department. In three studies the refer- 
ence standard was MRI [19,24,27], eight studies applied 
arthroscopy [23,26,29-32,34,35] and three studies ap- 
pUed either arthroscopy or arthrotomy [25,28,33]. Only 
five studies [25-27,29,30] reported in detail the method 
of index test application with slight variations between 
them in the way the index tests was performed. 

Nine studies [19,23,26,28,30-32,34,35] assessed diag- 
nostic accuracy for partial or complete ACL injuries, 
however only four of these [19,23,30,32] provided suffi- 
cient information to determine if the index test result 
pertained to a partial or complete disruption of the ACL. 
Injury severity (partial or complete ACL disruption) was 
unclear and treated as partial and complete injuries in the 
remaining studies. Nine studies [19,24-26,28,30,31,33,35] 
described ACL injuries with concomitant injury to 
other knee structures, while comorbid knee injuries 
were unclear or not reported in the remaining five 
studies [23,27,29,32,34]. 

There was variability between participants in the in- 
cluded studies with respect to sample size (50-350), 
average age (25-40 years), proportion of males (52%- 
100%) and time since ACL injury (one day to longer 
than one year). The prevalence of verified partial and 
complete ACL injury ranged from 21%-81%. 

Quality assessment 

The QUADAS-2 ratings of risk of bias and study applic- 
ability are shown in Table 2. Only one study [19] ad- 
equately addressed all risk of bias domains. For the 14 
studies, risk of bias was high or unclear with regard to 
patient selection for 10 studies, for the index text four 
studies, for the reference standard nine studies and for 
flow and timing eight studies. 
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Records identified through 
database searching 
(n =21,691) 




Additional records identified 
through citation tracking 
(n = 12) 










Records after duplicates removed 
(n = 10,796) 









Records screened 
(n= 10,808) 



Records discarded 
(n =10,523) 



Full-text articles assessed 
for eligibility 
(n = 285) 



Studies included in 
descriptive synthesis (n = 14) 



Full-text articles 
excluded: 

Type of study n=79 
Participants n=52 
Index tests n=140 



Investigations included 
quantitative synthesis 
[meta-analysis] 
(n = 0) 



Figure 1 PRISMA flow diagram of studies through the review. 



Only one study [19] clearly stated that the reference 
standard was assessed without knowledge of the results 
of the index test, while in 12 studies this was unclear 
[23-26,28-35]. In one study the reference standard was 
not applied independently of clinical tests [27]. Six stud- 
ies [24,27,28,32,34,35] included all enrolled participants 
in the analysis. Across the remaining eight studies 
[19,23,25,26,29-31,33] the number of participants left 
out of the analyses ranged from 1%-71% of those origin- 
ally included. 

Diagnostic accuracy of clinical tests 

A total of nine clinical index tests were identified by this 
review. Five tests were items from the clinical history 
(popping sound at time of injury, giving way, effusion, 
pain, ability to continue activity) and four index tests 
were applied as part of physical assessment (the anterior 
draw test, Lachmans test, prone Lachmans test, the 
pivot shift). Three of the tests were also performed 
under anaesthesia (anterior draw test, Lachmans test, 
pivot shift test). Diagnostic accuracy statistics for all 
index tests are presented as supplemental material 
(Additional file 3: Table S3). The anterior draw, 
Lachman and pivot shift tests were each evaluated in 
subgroups where the tests were applied in secondary 
contact settings to identified partial and complete ACL 
injury. The chi-square test ranged from = 50.66, 6df, 



P< 0.001 to x^ = 6.55, 4df, P = 0.16 and the inconsist- 
ency indexes were typically high (>75%) ranging from 
99.2% to 38.9%. The three physical tests plotted on the 
ROC plane as well the subgroups sensitivity and specificity 
forest plots are presented as supplementary information 
(Additional file 4: Figure SI, Additional file 5: Figure S2). 
The variability in patient spectrum and performance of 
index tests among the included studies resulted in import- 
ant clinical and statistical heterogeneity. In addition, only a 
small number of studies evaluate specific clinical tests, with 
all but one study at high risk of bias, so a decision was 
made not to perform a meta-analysis. The diagnostic accur- 
acy of individual clinical tests for ACL injury along with 
thresholds for defining clinical usefulness (i.e. small, moder- 
ate and large change in post-test probability) are illustrated 
in Figure 2. The number of studies that evaluated each indi- 
vidual test ranged from two studies for clinical history items 
to nine studies for Lachmans test. 

Only two studies [19,30], from different settings (pri- 
mary and secondary care), investigated test accuracy for 
clinical history items. Clinical history items had low 
value in correctly diagnosing ACL injury (+LR range 
0.93-2.54, -LR range 0.15-1.18) (Figure 2). 

Six studies [19,23,24,27,31,34] reported the accuracy of 
the anterior draw test in diagnosing ACL injury. Small, 
moderate and large +LR (range 1.94-87.88) were ob- 
served for the anterior draw test across studies. The 
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Table 1 Characteristics of included studies 



First author, 
year 


Design 


Setting 


Participants 


Partial and 
complete ACL 
tear prevalence 
% (n) 


Complete ACL 
tear prevalence 

% (n) 


Reference 
standard(s) 


Beldame, 
201 1 [Z5\ 


Prospective 


University hospital, France. 


*1 12 patient/knees with an 
indication for knee arthroscopy. 


37.5% (42) 


28.5% (32) 


Arthroscopy 


Boeree, 1991 
[24] 


Prospective 


Orthopaedic clinic, UK. 


203 patient/knees referred from 
GPs or the A&E. 


29.1% (51) 


nr 


MRI 


Decker, 1988 
[33] 


Prospective 


Hospital, Germany. 


tl08 patient/knees suspected 
to have knee ligament injury. 


61.1% (66) 


nr 


Arthroscopy/ 
Surgery 


Harilainen, 

1987 [25] 


Unclear 


Emergency department, 
Finland. 


1350 patient/knees with acute 
knee injury. 


41.7% (146) 


nr 


Arthroscopy/ 
Arthrotomy 


Katz, 1986 [26] 


Retrospective 


Community hospital, USA. 


85 participant/knees with knee 
injuries presenting for arthroscopy. 


25.9% (22) 


nr 


Arthroscopy 


Lee, 1988 [27] 


Retrospective 


Orthopaedic department 
of a hospital, USA. 


79 magnetic resonance studies 
of the knee were reviewed. 


29.1% (22) 


nr 


MRI 


Lucie, 1984 
[28] 


Prospective 


Orthopaedic clinic, USA. 


50 patient/knees with acute 
traumatic knee haemarthrosis. 


76.0% (38) 


nr 


Arthroscopy/ 
Arthrotomy 


Mulligan, 2011 
[29] 


Prospective 


Orthopaedic surgery and 
sports medicine service, 
USA. 


*t52 patient/knees with a 
complaint of knee pain referred 
from emergency department. 


44.2% (23) 


nr 


Arthroscopy 


Noyes, 1980 
[30] 


Prospective 


Orthopaedic/Sports 
medicine knee clinic, USA. 


*85 injured knees (83 patients) 
that had traumatic haemarthrosis. 


71.8% (61) 


43.5% (37) 


Arthroscopy 


Richter, 1996 
[34] 


Prospective 


Hospital, Germany. 


74 patient/knees with effusion 
of the knee following trauma. 


78.4% (58) 


64.9% (48) 


Arthroscopy 


Schwartz, 
1997 [35] 


Prospective 


Hospital, Germany. 


58 patient/knees with acute 
knee injury. 


81.0% (47) 


65.5% (38) 


Arthroscopy 


Tonino, 1986 
[31] 


Unclear 


Hospital, Netherlands. 


*66 patient/knees with acute 
symptoms of a ligamentous 
lesion of the knee after trauma. 


45.5% (30) 


nr 


Arthroscopy 


Wagemakers, 
2010 [19] 


Prospective 


GP clinics, Netherlands. 


*134 patient/knees with 
new knee symptoms. 


20.9% (28) 


12.7% (17) 


MRI 


Wong, 1999 
[32] 


Prospective 


Orthopaedic department of 
a hospital. Hong Kong. 


91 patient/knees with an acute 
knee haemarthrosis. 


nr 


56.0% (51) 


Arthroscopy 



nr: not reported. 

*Not all participants evaluated by index test(s). 
tNot all participants evaluated by reference standard. 



large +LR estimates all had wide confidence intervals and 
were reported in studies with high risk of bias. All -LRs 
(range 0.23-074) for the anterior draw test were within 
the small threshold. 

Nine studies [19,23-25,27,3132,34,35] investigated the 
accuracy of Lachmans test in diagnosing ACL injury. Small, 
moderate and large LRs (+LR range 1.39-40.81, -LR range 
0.02-0.52) were reported for Lachmans test across the stud- 
ies. Studies that report moderate or large LRs tended to be 
at risk of bias and had wide confidence intervals. One study 
[29] investigated the prone Lachmans test and reported 
small and imprecise LRs (+LR 3.50, -LR 0.38). 

Five studies [23,24,30,31,34], all at risk of bias, evalu- 
ated the accuracy of the pivot shift test. Small, moderate 
and large +LRs (range 4.37-16.42) and small -LRs (range 
0.38-0.84) were reported for the pivot shift test in all 
studies. Accuracy estimates with moderate and large 
+LRs tended to lack precision. 



Five studies at high risk of bias [26,28,30,31,33] inves- 
tigated physical tests when examination was performed 
under anaesthesia (EUA) (Additional file 6: Figure S3). 
The anterior draw test, Lachmans test and pivot shift test 
appear to provide improved diagnostic accuracy when 
examination is performed under anaesthesia. While LRs are 
moderate-large the confidence intervals around the +LR 
estimates are wide. 

Only one study, from the primary care setting with 
low risk of bias, provided data on the effect of combin- 
ing clinical tests [19]. Specifically, this included two or 
three positive history tests (from a list of popping sensa- 
tion, giving way, effusion, immediate pain at trauma and 
continuation of activity impossible) as well as a positive 
anterior draw or Lachmans test (Figure 3). The addition 
of a positive anterior draw test to the combinations of 
two positive history tests increase the +LR (4.81) close to 
moderate diagnostic threshold. The addition of a third 
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Table 2 Risk of bias and applicability concerns summary based on the QUADAS-2 checklist 



Risk of bias 



Applicability concerns concerns 



First author, year 



Patient selection Index test Reference standard Flow and timing Patient selection Index test Reference 

standard 



Beldame, 2011 [23] ? 


+ 


+ 




+ 


+ 


+ 


Boeree, 1991 [24] 


7 


+ 


+ 


+ 


+ 


+ 


Decker, 1988 [33] ? 


+ 


7 




+ 


+ 


+ 


Harilainen, 1987 [25] + 


7 


7 




+ 


+ 


+ 


Katz, 1 986 [26] 


+ 


7 


+ 


+ 


+ 


+ 


Lee, 1988 [27] 


+ 


7 




+ 


+ 


+ 


Lucie, 1984 [28] 


+ 


7 




+ 


+ 


+ 


Mulligan, 2011 [29] + 


+ 


7 


7 


+ 


+ 


+ 


Noyes, 1980 [30] 


7 


+ 




7 


+ 


+ 


Richter, 1996 [34] 


+ 


7 


+ 


+ 


+ 


+ 


Schwartz, 1997 [35] 


+ 


+ 


+ 


+ 


+ 


+ 


Tonino, 1986 [31] 


+ 


7 


7 


+ 


+ 


+ 


Wagemakers, 2010 [19] + 


+ 


+ 


+ 


+ 


+ 


+ 


Wong, 1999 [32] + 


7 


7 


+ 


+ 


+ 


+ 


Judgements on risk of bias and applicability concerns: - 


- high risk; ? = 


unclear risk; + = low risk. 









history test produced a large but imprecise +LR (35.64) 
but reduced the -LR (0.82). 

Discussion 

This systematic review included 14 studies that evalu- 
ated the diagnostic accuracy of clinical tests for ACL in- 
jury. Just one study, which was the only study performed 
in primary care, had a low risk of bias and showed that re- 
sults of individual tests produce only small changes in the 
probability of ACL injury. The same study investigated the 
diagnostic accuracy of combining history items with phys- 
ical tests and reported improved accuracy when doing so. 
The other studies, performed in secondary contact settings, 
had moderate to high risk of bias and reported quite di- 
verse and imprecise estimates of diagnostic accuracy. Based 
upon these findings, clinical tests in combination, but not 
individually, may assist the diagnosis of ACL injury. 

The key strengths of the review include a pre-specified 
and registered review protocol, the use of inclusion criteria 
to ensure that the study settings reflected clinical practice 
and the evaluation of study quality using the QUADAS-2 
checklist. This review also reported likelihood ratios as they 
are the preferred approach to report estimates of diagnostic 
accuracy [21]. The limitations of the study were that sparse 
data were available on most clinical tests and that we were 
unable to perform a meta-analysis due to heterogeneity in 
the estimates of diagnostic accuracy, risk of bias and clinical 
characteristics. The heterogeneity among studies is well il- 
lustrated by the results for Lachmans test, where reported 
+LRs ranged from 1.5 to 102, risk of bias varied and ACL 
injury prevalence in the included studies ranged from 21% 
to 81%. 



The clinical tests reviewed are those most commonly 
used for the diagnosis of ACL injury in clinical practice. 
Our findings suggest that a clinician cannot rely on a 
single clinical test in isolation, particularly one from the 
clinical history, to identify patients with ACL injury. Due 
to the fact that diagnostic decisions regarding ACL in- 
jury are not made on the basis of a single test, studies 
should ideally focus on test performance in combination. 
The best estimates of diagnostic accuracy come from 
Wagemakers et al. [19] whose data suggest that there 
may be some potential in combining clinical tests, spe- 
cifically the anterior draw test with two or three of the 
foUowing five history findings: popping sensation, giving 
way, effusion, immediate pain at trauma and inability to 
continue activity. Notwithstanding, these findings must 
be interpreted with caution as a major drawback of 
Wagemakers et als study was its low power to suffi- 
ciently analyse multiple combined tests. An important 
direction for future research is identification of the opti- 
mal combination of currently avaflable clinical tests to 
accurately diagnose ACL injury. While the literature re- 
garding the accuracy of currently used tests is of variable 
quality, those identified in this body of literature (and in- 
cluded in this review) are the logical candidates to inves- 
tigate using more robust methods. Such studies are weU 
suited to primary care settings (limiting referral bias), 
but sample sizes will need to be substantially larger than 
studies to date in order to investigate multiple sequen- 
cing of index tests. 

In contrast to our findings, previous systematic re- 
views have concluded that individual clinical tests can be 
used to accurately diagnose ACL injury [11,14]. The 
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Index test 

First Author 
Popping 

Wagemakers, 2010 

Wagemakers, 2010 

Noyes, 1980 
Giving way 

Wagemakers, 2010 

Wagemakers, 2010 

Noyes, 1980 
Effusion 

Wagemakers, 2010 

Wagemakers, 2010 

Noyes, 1980* 

Noyes, 1980^ 
Pain 

Wagemakers, 2010* 

Wagemakers, 2010* 

Noyes, 1980^ 

Noyes, 1980 " 

Noyes, 1980^ 
Continue activity 

Wagemakers, 2010 

Wagemakers, 2010 

Noyes, 1980 
Anterior draw 

Wagemakers, 2010 

Wagemakers, 2010 

Beldame, 2011 

Beldame, 2011 

Richter, 1996 

Boeree, 1991 

Tonino, 1986 

Lee, 1988 

Lachman 

Wagemakers, 2010 
Wagemakers, 2010 
Schwarz, 1997 
Beldame, 2011 
Beldame, 2011 
Boeree, 1991 
Richter, 1996 
Wong, 1999 
Harilainen, 1987 
Tonino, 1986 
Lee, 1988 

Prone Lacliman 

Mulligan, 2011 
Pivot shift 

Beldame, 2011 
Beldame, 2011 
Noyes, 1980 
Noyes, 1980 
Tonino, 1986 
Boeree, 1991 
Richter, 1996 



Risl< of Bias 

12 3 4 



Lil<elihood ratio 



0.02 

—I 



0.51(0.31 


0.85) 


2.3(1.5-3.6) 


0.51 (0.26 


0.98) 


2.1(1.4-3.4) 


0.95 (0.65 


1.40) 


1.1(0.5-2.3) 


0.63 (0.39 
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(See figure on previous page.) 

Figure 2 Diagnostic accuracy of clinical examination for ACL injury. Legend: Risl< of bias judgements: (-) = liigli risl<; (?) = unclear risl<; (+) = 
low risk. LR thresholds: +LR <5 and -LR >0.2 = small; +LR 5-10 and; -LR 0.1-0.2 = moderate and +LR >10 and -LR <1 = large. Studies that reported 
estimates for complete ACL injury as well as partial and complete ACL injury estimates have been plotted together to provide a comparison of 
test performance. Different symbols are used for the estimates for complete versus partial and complete ACL injury and for primary care versus 
secondary contact settings. *joint effusion 2 hours; tjoint effusion 12 hours; ^immediate pain at trauma; §pain none to slight; ||pain moderate to 
severe; Hguarded or painful ROM 24 hours after injury. Guide for interpretation: Greater distance between the -LR and +LR symbols for the test 
indicates better diagnostic performance. 



difference in conclusions is primarily because we only 
included studies evaluating a clinical sample with diag- 
nostic uncertainty. Other reviews have included case- 
control studies, a study design which has been shown 
to inflate estimates of diagnostic accuracy [36]. Our 
decision to interpret test accuracy via clinically usefully 
thresholds of likelihood ratios also distinguishes this 
from previous reviews. A final point of difference con- 
cerns our decision not to pool accuracy estimates, 
which we believe this is the only appropriate course 
given the risk of bias and heterogeneity evident in the 
included studies. 

Although we applied a critical approach to study selec- 
tion we still identified several methodological issues that 
affect internal validity and may result in overestimation 
of diagnostic test accuracy [17,37]. The spectrum of pa- 
tients in the included studies varied because of different 
methods in patient sampling. Most obviously, the char- 
acteristics of the samples varied due to the differences in 
study inclusion and exclusion criteria. Two recent pro- 
spective cohort studies illustrate this: Wagemakers et al. 
[19] included participants with new knee symptoms and 



excluded participants who were suspected of knee frac- 
ture; whereas Beldame et al. [23] included participants 
with indication for knee arthroscopy, meaning the sam- 
ple was subject to referral filter bias [37]. The paucity of 
diagnostic studies for ACL injury conducted in primary 
care also suggests caution should be taken when general- 
ising these findings to this setting. 

In some instances the index tests were not applied to 
all participants prior to the application of the reference 
test, or the reference test was performed without a clin- 
ical test. There was under reporting of reasons for pa- 
tient exclusion and withdrawals. Reporting was deficient 
in most primary studies which limited our appraisal of 
study quality. This is perhaps most evident with respect 
to risk of bias domains associated with blinding of the 
index tests and reference standards. Where multiple 
index tests were applied concurrently it is unclear the 
extent to which knowledge of prior testing (test review 
bias) overestimated the accuracy of index tests. Similarly, 
there was concern that the invasive nature of knee arth- 
roscopy or surgery as a reference test may have affected 
the flow of participants through some studies. In these 
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instances a patient with a negative index test may not 
have received a reference test creating partial verification 
bias. 

Conclusion 

This systematic review of clinical tests for ACL injury in- 
corporates the most recent knowledge of diagnostic test 
accuracy methods. The findings highlight the lack of 
clinical test accuracy data to support the use of history 
and physical examination to diagnose ACL injury. Most 
diagnostic studies on this topic contain methodological 
flaws which can overestimate the diagnostic accuracy of 
clinical tests. The available high quality evidence sug- 
gests that tests are not useful on their own but combina- 
tions may prove to be more useful. 
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